首页> 外文OA文献 >Group variable selection via convex log-exp-sum penalty with application to a breast cancer survivor study
【2h】

Group variable selection via convex log-exp-sum penalty with application to a breast cancer survivor study

机译:通过凸对数-总和罚分进行群体变量选择并应用于乳腺癌幸存者研究

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In many scientific and engineering applications, covariates are naturally grouped. When the group structures are available among covariates, people are usually interested in identifying both important groups and important variables within the selected groups. Among existing successful group variable selection methods, some methods fail to conduct the within group selection. Some methods are able to conduct both group and within group selection, but the corresponding objective functions are non-convex. Such a non-convexity may require extra numerical effort. In this article, we propose a novel Log-Exp-Sum(LES) penalty for group variable selection. The LES penalty is strictly convex. It can identify important groups as well as select important variables within the group. We develop an efficient group-level coordinate descent algorithm to fit the model. We also derive non-asymptotic error bounds and asymptotic group selection consistency for our method in the high-dimensional setting where the number of covariates can be much larger than the sample size. Numerical results demonstrate the good performance of our method in both variable selection and prediction. We applied the proposed method to an American Cancer Society breast cancer survivor dataset. The findings are clinically meaningful and may help design intervention programs to improve the qualify of life for breast cancer survivors.
机译:在许多科学和工程应用中,协变量是自然分组的。当在协变量中有可用的组结构时,人们通常会对识别重要组和选定组中的重要变量感兴趣。在现有成功的组变量选择方法中,有些方法无法进行组内选择。一些方法既可以进行组选择,也可以进行组内选择,但是相应的目标函数是非凸的。这样的非凸性可能需要额外的数值努力。在本文中,我们为组变量选择提出了一种新的Log-Exp-Sum(LES)惩罚。 LES惩罚严格是凸的。它可以识别重要的组,也可以选择组内的重要变量。我们开发了一种有效的组级坐标下降算法来拟合模型。对于协变量数可能远大于样本量的高维环境,我们还为我们的方法得出了非渐近误差界和渐近群选择一致性。数值结果证明了我们的方法在变量选择和预测方面的良好性能。我们将提出的方法应用于美国癌症协会乳腺癌幸存者数据集。该发现具有临床意义,并可能有助于设计干预计划,以改善乳腺癌幸存者的生活质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号